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modification  of  the  PAB  software. 


CONTENTS 


EXPERIMENT  I  . 

Methods  . 

Results . . . 

EXPERIMENT  II  . 

Methods  . 

Results  . 

EXPERIMENT  III  . 

Methods  . 

Results  . 

GENERAL  DISCUSSION  . 

REFERENCES  . 

APPENDIX 

Analysis  of  Variance  Tables  . 

FIGURES 

1.  Memory/Visual  Search  Task:  Mean  Response  Latency  as  a  Function 

of  Memory  Set  Size  and  Target  Search  Set  Size  . 

2.  Memoi y/Visual  Search  Task:  Error  Rates  as  a  Function  of  Memory 

Set  Size  and  Target  Search  Set  Size  . 

3.  Add/Subtract  Task:  Mean  Response  Latency  as  a  Function  of  Number 

of  Stimulus  Items  and  the  Stimulus  Durations  . 

4.  Add/Subtract  Task:  Error  Rates  as  a  Function  of  Number  of  Stimulus 

Items  and  the  Stimulus  Durations  . 

5.  Column  Addition  Task:  Mean  Response  Latency  as  a  Function  of  Number 

of  Digits  per  Item  and  Number  of  Items  to  be  Added  . 

b.  Column  Addition  Task:  Error  Rates  as  a  Function  of  Number  of  Digits 
per  Item  and  Number  of  Items  to  be  Added  . 


i.  Regression  Coefficients  for  the  Memory  Search  Task:  Predicting 

Response  Latency  (CRT)  From  Task  Variables  . 

.1  .  Regression  Coefficients  for  the  Serial  Arithmetic  Task:  Predicting 
Response  Latency  (CRT)  From  Task  Variables  . 

i.  Regression  Coefficients  for  the  Addition  Task:  Predicting  Response 
Latency  I  CRT)  From  Task  Variables . 


MKASl'KIM.  ME  NT  A!  WORKLOAD:  A  PERFORMANCE  BATTERY 


The  measurement  (jt  mental  workload  has  been  investigated  extensively  during 
the  past  decade  (see  IvuKeus  i  Kramer  ,  l  98  < ,  to  r  review).  There  is  no  single 
definition  or  workload  that  ha»  bee.,  t’und  to  be  acceptable  and,  therefore,  it  is 
not  surprising  that  there  is  still  considerable  controversy  about  the  appropriate 
methods  to  be  used  in  assessing  wor<J  jad .  despite  Knowles  (1963)  recommendation 
that  efforts  be  made  to  arrive  .t  i  leu  generally  accepted  tests,  this  goal  has  not 
been  achieved . 

The  goals  of  any  such  method  are  to  show  sensitivity  to  changes  in  workload, 
lack  of  intrusion  on  the  performance  ol  the  primary  task,  and  representativeness 
across  job  categories.  Chiles  and  Alluisi  (1979)  have  recommended  the  use  of 
laboratory  tasks  whenever  teasible  to  achieve  these  goals.  They  argued  that 
laboratory  tasks  are  preferable  to  measuring  the  job  itself  (primary  task 
measurement)  because  findings  from  one  primary  task  do  not  generalize  to  other 
dissimilar  jobs.  This  lack  of  representativeness  is  a  major  stumbling  block  in  the 
development  of  workload  measurements  based  on  primary  tasks.  The  use  of  a  single 
representative  task  to  measure  generic  mental  workload  is  probably  an  impossible 
goal  because  of  the  large  domain  of  cognitive  activities  found  in  various  types  of 
mental  work  (LePlat,  1978).  In  contrast  to  primary  task  measurements,  laboratory 
tasks  can  sample  specific  cognitive  activities  and  can  be  combined  with  various 
primary  tasks  to  index  their  workload.  Knowles  (1963)  recommended  that  these  tasks 
should  sample  various  cognitive  functions  found  in  the  workplace  (e.g.,  mental 
arithmetic  is  representative  of  the  general  problem-solving  requirements  of  mental 
work).  Gopher  and  Sanders  (1984)  have  echoed  this  same  recommendation  20  years 
later.  However,  there  still  appears  to  be  no  resolution  in  the  final  choice  of 
measurement . 

The  use  of  more  than  one  type  of  measurement  has  been  recommended  by  various 
investigators  because  of  the  inconsistency  or  insensitivity  of  any  one  measurement 
(LePlat,  1978;  Moray,  1982;  Simmons,  1979).  When  convergent  measurements  from 
various  techniques  can  point  to  specific  tasks  as  having  high  workload,  the  evidence 
is  far  stronger  than  when  only  a  single  measure  is  used. 

Moray  (1982)  noted  the  generally  poor  correlations  among  subjective  opinion 
(operator  reports  of  task  difficulty),  physiological,  and  performance 
measurements.  He  recommended  further  research  attempts  that  assessed  workload  by 
using  several  measurement  techniques  drawn  from  the  subjective,  physiological,  and 
performance  domains.  This  is  consistent  with  Wickens  and  Kramer's  (1985)  position 
that  workload  is  best  viewed  "as  a  multidimensional  construct  that  includes 
behavioral,  physiological,  and  subjective  aspects. "(p.  316). 

The  present  research  was  the  first  step  in  the  development  of  a  test  battery 
that  could  compare  workload  measures  in  these  three  categories.  The  initial  work 
was  the  construction  of  a  computer  program  that  could  present  generally  accepted 
cognitive  tasks  at  a  variety  of  difficulty  (workload)  levels.  These  tasks  were 
then  tested  to  determine  the  influence  of  such  difficulty  manipulations  on 
performance.  The  goaL  of  these  experiments  was  to  determine  a  range  within  which 
performance  was  sensitive  to  changes  in  task  parameters.  On  the  basi9  of  these 
results,  subsequent  experiments  will  compare  a  subset  of  these  task  conditions  to 
measure  subjective  and  physiological  responses,  as  well  as  performance.  This  test 
of  convergent  measures  will  provide  the  correlations  recommended  by  Moray  (1982)  to 


assess  Cask  variables  sensitive  to  all  three  measurement  techniques.  This  report 
was  written  to  make  available  the  test  battery  and  these  initial  performance  data. 

The  test  battery  itself  was  based  on  a  performance  assessment  battery 
developed  by  Walter  Reed  Biomedical  Laboratory  (Thorne,  Censer,  Sing,  6  Hegge , 
1985).  In  its  original  form  the  battery  consists  of  13  performance  tasks;  however, 
the  parameters  (workload)  exist  at  only  one  level  tor  each  task.  In  the  modified 
program,  it  is  possible  to  vary  a  number  of  stimulus  parameters  to  determine  their 
effect  on  the  performance  output. 

In  a  series  of  three  experiments,  a  visual  search  and  two  types  of  mental 
arithmetic  tasks  were  examined  to  determine  which  stimulus  parameters  affected 
performance.  Variation  in  workload  Levels  was  operationally  defined  as  those 
changes  in  task  parameters  that  produce  changes  in  performance. 


EXPERIMENT  1 

Immediate  memory  for  letters  has  been  examined  in  a  variety  of  paradigms  to 
determine  the  effects  of  memory  load  (Sternberg,  1969)  and  visual  search  complexity 
(Neisser,  Novick,  &  Lazar,  1963)  on  processing  time.  Kaplan,  Carvellas,  and  Metlay 
(1966)  combined  both  these  variables  in  a  memory  and  search  task  (MAST).  While 
subjects  completed  a  paper  and  pencil  test,  their  eye  movements  were  measured  to 
determine  the  time  spent  scanning  a  10-item  search  string  of  letters  for  an 
embedded  subset  of  1  to  10  letters.  The  memory  set  was  changed  for  each  trial; 
therefore,  subjects  were  working  on  an  immediate  (short-term  memory)  task.  Scan  time 
increased  linearly  as  a  function  of  memory  set  size. 

Folkard,  Knauth,  Monk,  and  Rutenfranz  (1976)  examined  a  similar  task  in  which 
subjects  searched  a  20-letter  string  for  a  memorized  subset  of  two,  four,  or  six 
items;  however,  the  memory  set  items  were  constant  for  an  entire  block  of  96 
trials.  The  authors  reported  that  the  number  of  lines  searched  varied  as  a 
function  of  the  time  of  day.  When  the  average  performance  at  each  memory  set  size 
is  computed,  the  slope  is  not  linear.  Instead  the  processing  time  per  memory  item 
decreased  as  a  function  of  memory  set  size. 

In  both  these  studies,  subjects  were  allowed  to  return  to  the  memory  set  to 
review  it  after  they  had  begun  scanning  the  search  string.  Kaplan  et  al .  (1966) 
measured  the  time  subjects  spent  reviewing  the  memory  set  and  found  it  to  be  a 
positively  accelerated  function  of  memory  set  size.  Therefore,  the  amount  of 
processing  time  as  a  function  of  memory  load  is  not  linear  when  the  total  time 
spent  on  scanning  the  search  string  and  returning  to  review  the  memory  set  are 
combined.  Instead  Kaplan  et  al .  (1966)  found  that  the  total  processing  time  per 

memory  item  increased  as  memory  set  size  increased .  Response  latency  results  of 
the  present  study  will  be  analyzed  to  determine  whether  the  best  fit  regression 
function  shows  an  increasing  or  decreasing  time  cost  per  memory  set  item. 

Kaplan  et  al  .  (  1966)  described  their  task  as  an  immediate  memory  task.  Such 
tasks  have  been  shown  to  be  sensitive  to  the  amount  of  time  (retention  interval) 
following  the  presentation  of  the  memory  set  before  recall  is  allowed  (Peterson  & 
Peterson,  1.959) .  Keppel  and  Underwood  (1962)  reported  that  tasks  with  variable 
memory  sets  (i.e.,  a  new  memory  s’t  for  each  trial)  were  very  susceptible  to 
forgetting  as  a  function  of  retention  interval  length.  Tiuse  results  were 
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obtained  for  tasks  in  which  the  retention  interval  was  filled  with  a  distracting 
task  to  prevent  rehearsal  of  the  memory  set.  Klatzky  (1984)  has  reviewed  immediate 
memory  processes  and  concluded  that  the  retention  interval  can  serve  as  a  period  of 
consolidation  to  develop  richer  memory  traces  that  resist  forgetting,  if  the 
retention  interval  is  free  of  a  distracting  task. 

The  present  experiment  was  designed  to  replicate  the  conditions  tested  by 
Kaplan  et  al  .  (  1966)  and  Folkard  et  al.  (1976)  using  a  computer  presentation  that 
prevented  the  subject's  looking  back  at  the  memory  set  after  beginning  to  scan  the 
search  string.  In  this  way,  response  latency  would  reflect  the  total  processing 
time  once  the  scan  had  begun.  The  purpose  was  to  oetermine  whether  the  search  for  a 
larger  set  of  letters  was  more  or  less  efficient  (time  per  item)  than  was  the  search 
for  a  smaller  set  of  memorized  items.  Briggs  (1974)  surveyed  the  literature  to 
determine  the  shape  of  the  function  that  predicted  response  latency  from  memory  set 
size.  He  concluded  that  some  data  were  better  fit  by  linear  (equal  time  per  item) 
and  some  by  logarithmic  (less  time  per  item  as  set  size  increases)  functions. 
Therefore,  analyses  of  the  present  data  were  planned  to  test  the  fit  of  both 
functions.  In  addition,  the  retention  interval  was  varied  to  determine  whether 
time  to  consolidate  the  memory  set  would  facilitate  search;  hence  a  new  memory  set 
was  used  cn  each  trial. 

Each  of  these  variables  represents  a  different  type  of  load  in  operational 
tasks.  The  memory  component  is  critical  to  any  coding  or  transcription  task  and  has 
been  found  to  be  a  function  of  both  the  memory  set  size  and  retention  interval 
(e.g.,  Sternberg,  1969;  Wickens ,  1984). 


METHODS 


Subjects .  Twenty  university  students  were  tested.  Twelve  of  these 
participants  were  females  and  eight  were  males.  Eight  were  graduate  students,  while 
the  remaining  students  were  undergraduates.  They  were  found  to  have  normal  or 
corrected  to  normal  visual  acuity.  Subjects  were  randomly  divided  into  two  retention 
interval  groups  with  the  restriction  that  half  of  the  graduate  students  and  half  of 
the  males  were  in  each  group.  One  group  participated  in  the  condition  with  a  3- 
second  retention  interval  between  the  offset  of  the  memory  string  and  the  onset  of 
the  search  string.  The  other  group  had  a  O-second  retention  interval.  Each 
subject  participated  in  a  single  session  during  which  he  or  she  completed  all 
repeated-measures  conditions  in  the  experiment. 

Des ign .  The  task  levels  were  the  following:  (2,  4,  or  6  memory  letters)  x  (9 
or  19  search  letters).  Each  of  these  six  conditions  was  presented  in  two  separate 
blocks  of  11  trials.  The  order  of  presentation  of  the  six  conditions  was 
counterbalanced  to  control  zero-  and  first-order  position  effects.  The  second  block 
ot  six  conditions  was  in  the  reversed  order;  thus  providing  an  ABBA  design. 

Procedure .  Each  subject  was  tested  individually.  The  subject  was  first 
tested  to  determine  that  his/her  vision  was  either  normal  (20/25)  or  corrected  to 
that  level  by  the  subject's  own  glasses  or  contact  lens.  The  program  is  designed 
to  provide  a  menu  and  prompts  that  allow  the  subject  to  serve  as  experimenter  by 
keying  in  the  parameters  for  the  next  block  of  trials;  however,  an  experimenter  was 


always  present  in  the  room  to  answer  questions  and  to  determine  whether  the  •urie~f 

was  following  the  correct  procedure.  The  subject  was  sea  tec  at  t  r-  t  h 

screen.  The  mentor  v  string  spanned  2.6'  to  4  visual  angle,  whil.-  the  t.-.iget  .  r  : ... 

,  o  ,  o 

spanned  r  roiii  t>  to  id./  . 

On  each  trial,  a  fixation  point  was  marked  by  a  i  i.  the  ce:;te. 

screen.  After  a  1.5-second  warning  interval,  the  memory  set  at  two,  tour, 
letters  (centered  horizontally  about  the  fixation  position)  repi.heu  :’u 

After  2  seconds,  the  screen  was  blanked  and  either  the  se  ircii  stri.1 appe.it  • 
immediately  (O-second  retention  group)  or  a  3-second  retention  inter'.  ;!  •  »rr..  ■ 

before  the  search  string  appeared.  The  search  string  contained  either  w '  .  • 

letters.  For  either  search  string  length,  half  of  the  strings  contained  j  1  i 
memory  set  letters.  The  subjects  responded  to  these  sets  bv  pressing  tie  .>  ,  '■ 

key.  On  the  remaining  sets,  it  least  one  of  the  memory  set  letters  was  missing.  n 
these  trials,  t.ie  subject  pressed  the  D  (different)  key. 

Subjects  were  instructed  to  work  as  quickly  as  possible  without  making 
errors.  The  subject  first  completed  a  block  of  practice  trials  ior  each  of  the  six 
conditions.  There  were  five  trials  in  each  practice  block.  If  there  were  no 
questions,  the  subject  ttien  began  the  data  trials.  After  completing  one  block  of  11 
trials  for  each  of  the  six  conditions  in  the  designated  order,  the  subject  nad  a 
5-minute  break  and  then  completed  one  more  block  of  each  condition  in  the  reverse 
order.  Summary  response  time  and  accuracy  data  were  displayed  on  the  screen  at  the 
end  of  each  block  of  trials.  The  subjects  were  encouraged  to  compete  with  their 
own  scores  to  improve  their  performance. 


Equ  i  pment  .  An  Apple-'IIe  system  with  two  disk,  drives,  128  K  memory,  and  a 
<  monochromatic  screen  was  used  to  present  the  stimulus  information  and  register  the 

subject's  response.  A  timer  board  from  California  Computer  Systems  (Model  <l  7220A) 
was  inserted  in  slot  '*2  to  time  the  duration  of  the  response  intervals.  The  data 
were  output  to  an  Apple" Imagewriter  printer  for  later  analysis.  The  parameters  and 
subject  identification  code  were  printed  for  each  block  of  trials.  The  accuracy  and 
response  time  for  each  trial  were  recorded  as  well  as  summary  data  for  each  11- 
trial  block. 

The  test  battery  itself  was  modified  from  the  Walter  Reed  Performance 

\ssessment  Battery  (Thorne  et  al  .  ,  1985).  This  program  is  written  in  Applesoft" 

BASIC  under  DOS" 3.3  and  is  available  in  both  an  uncompiled  and  compiled  version. 
The  modified  test  battery  provided  the  opportunity  to  vary  (via  menu)  the  following 
parameters  in  the  memory  search  task  used  in  the  present  study:  the  number  of 

trials  in  any  block,  the  number  of  letters  in  the  memory  set,  retention  interval 

between  the  nem./ry  set  and  the  search  string,  and  the  number  of  letters  in  the 

search  stri  r.  g  . 


RESULTS 


response  times  to  correct  trials  in  each  block 
variance.  A  mixed-factors  design  was  used 

tit.  grouping  variable  (retention  interval)  and 
■re  set  size,  search  set  size,  and  trial  block). 

>•  cnivsis  of  variance  is  included  in  tile  \ppendix. 
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dramatically  dir  f  erent  t»»r  the  two  vai  •  ■'.r.nc  site,  .a  vs.  !  1 ) .  It  is  prt  ;r.  .mre 
to  Interpret  the  three-way  interact  l.  ,i  oi  memo  r  y  set  size,  search  set  size,  and 
retention  interval;  however,  the  means  ot  this  interact  ion  are  graphed  in  Figure  . 

When  the  graphs  of  the  two  dependent  va  *•  i  a  h  1  v  s  are  considered  together,  it 
appears  that  a  speed  accuracy  trade-ot'  has  occurred  between  retention  intervals. 
At  the  shorter  (O-second)  interval,  response  limes  are  less  atfected  by  memory  set 
size,  while  error  rates  increase  sharply.  At  the  longer  (3-second)  interval,  the 
response  time  slope  as  a  function  of  memory  set  size  is  steeper;  however,  error 
rates  increased  less  than  for  the  shorter  retention  interval.  There  is  no  apparent 
reason  in  the  experimental  procedure  for  this  trade-off;  therefore,  this  phenomenon 
may  merit  turther  study. 

In  an  exploratory  regression  analysis,  the  data  for  latencies  of  correct 
responses  were  fit  to  linear,  quadratic,  and  logarithmic  models  for  each  retention 
interval  group.  A  "best"  model  was  found  by  allowing  variables  to  enter  the 
equation  in  a  forward  stepwise  regression  (SPSS,  1970)  until  the  addition  of  m^re 
variables  did  not  significantly  increase  the  proportion  of  variance  explained  (R  ). 
The  best  linear  model  entered  both  memory  set  size  and  search  set  size  for  each 
group.  Both  the  quadratic  and  the  logarithmic  model  provide  a  similar  fit,  but  may 
be  less  straightforward  to  interpret.  The  regression  equations  for  each  model  are 
listed  in  Table  1.  None  of  these  models  has  been  cross-validated;  however,  the 
percent  variance  explained  is  in  the  range  Nunnally  (1973)  describes  as  a 
moderately  ( R  =  .40-. 50)  or  a  very  (R  =  .50-. 60)  satisfactory  predictor. 

None  of  these  models  fits  the  RI  =  0  data  as  well  as  it  fits  the  RI  =  3  data. 
The  linear  model  fits  as  well  as  the  more  complex  models  for  each  dependent 
variable.  However,  it  should  be  noted  that,  ior  the  RI  =  3  group,  the  quadratic 
model  fits  the  response  latency  data  as  well  with  one  independent  variable 
(MKM*SLAK)  as  the  linear  model  does  with  two  independent  variables. 
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These  results  are  consistent  with  the  expected  results  with  regards  to  memory 
set  size  and  search  string  size.  Larger  values  oi  ear::  variable  resulted  in 
increase!  response  time.  However,  the  regression  anal  vs i s  provided  no  clarification 
of  whether  a  linear  or  a  log  linear  model  best  di st  r i  K-s  t  he  performance  latency 
data  !  Briggs ,  1474).  In  the  aggregate,  the  sub  le.-t*  with  t  tu.  longer  retention 
interval  had  lower  error  rates.  However,  it  tt  highest  t.<t,  ,  r  v  leads  ,  the  •■rror 
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3-second  group  than  those  for  the  O-second  retest  i  <  r  -gr*  op . 
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sual  search  task:  Error  rates  as  a  function  of  memory  set  size 


Table  1 


Regression  Coefficients  for  the  Memory  Search  Task: 
Predicting  Response  Latency  (CRT)  From  Task  Variables 


MODEL  1:  LINEAR  (Possible  independent  variables  are  memory,  search,  and  block.) 

Retention  Interval 


0  seconds 

3  seconds 

MEMORY 

.24 

.51 

SEARCH 

.15 

.20 

CONSTANT 

.09 

-1.11 

R2 

.25 

.44 

MODEL  2:  QUADRATIC  (Possible  independent  variables  are  memory,  search, 

memory*search ,  memory  ,  search  ,  block.) 


Retention  Interval 


0  seconds 

3  seconds 

SEARCH2 

.003 

MEMORY* SEARCH 

.017 

.042 

CONSTANT 

1.580 

1.440 

R2 

.25 

.44 

MODEL  3:  LOGARITHMIC 

(Possible  independent 

variables  are  log  (memory) 

log? ( search)  ,  block.) 

Retention  Interval 

2 

0  seconds 

3  seconds 

LOG (MEMORY) 

.638 

1.282 

L0G( SEARCH) 

1 .410 

1.896 

CONSTANT 

-3.230 

-5.640 

EXPERIMENT  II 


Mental  arithmetic  has  been  used  to  assess  workload  in  several  previous 
batteries  (see  Ogden  et  al  •  ,  1979,  for  review).  Although  simple  arithmetic  is  a 
skill  mastered  t>y  most  iLterate  adults,  the  method  by  which  we  compute  is  not  w<  11- 
understood.  Groen  and  Parkman  (1972)  proposed  a  counting  model  to  explain  response 
latency  to  simple  addition  problems.  However,  Ashcroft  and  Battaglia  (1978) 
reported  lesearch  that  more  closely  matched  a  memory  search  and  retrieval 
explanation.  They  found  that  subjects  could  determine  the  veracity  of  a  sum  in 
approximately  900  msec,  while  the  size  or  the  actual  digits  to  be  added  contributed 
Little  (less  than  2  msec/digit)  to  the  total  reaction  times  tor  adult  subjects. 
They  did  report  that  response  times  increased  the  square  of  the  digit  sum  and 
hence  that  the  total  reaction  time  would  increase  nonlinearly  with  the  magnitude  of 
the  problem. 

The  mental  arithmetic  task  used  in  the  present  study  required  that  the 
subject  calculate  a  running  total  of  addition  and  subtraction  operations  on  a 
series  of  single  digit  numbers.  This  task  varied  the  magnitude  of  the  problem  by 
altering  the  number  of  digits  in  the  series.  A  linear  increase  in  response  time 
would  be  predicted  if  including  more  computations  in  a  single  problem  merely 
multiplied  the  number  of  operations  that  had  been  necessary  to  compute  a  two- 
element  problem.  However,  if  computation  time  was  increased  as  an  exponential 
scale  of  task  difficulty,  then  the  resulting  response  latencies  would  increase  as 
the  quadratic  function  described  by  Ashcroft  and  Battaglia  (1978). 

In  addition,  the  stimulus  duration  of  each  digit  in  the  problem  was  varied  to 
determine  whether  computation  on  the  subtotal  could  be  conducted  during  the  course 
of  the  series  presentation.  Gopher  and  Sanders  (1984)  have  discussed  the  question 
of  operation  interruption  in  the  context  of  dual-task  time-sharing.  They  have 
noted  that  at  some  stages,  two  tasks  cannot  be  time-shared  because  one  stage  will 
essentially  capture  processing  capacity  for  the  duration  of  its  execution.  In  the 
analogous  aspect  of  the  current  task,  the  question  to  be  tested  was  whether 
computation  on  the  partial  sum  could  be  accomplished  efficiently  during  the  period 
prior  to  the  presentation  of  the  next  digit  in  the  problem.  If  the  subject  could 
perfectly  time-share  the  processing  of  the  current  subtotal  and  the  processing  of 
the  next  incoming  stimulus  digit,  then  reaction  time  (measured  from  the  offset  of 
the  final  digit)  would  show  a  decrease  equal  to  the  extra  processing  time  allowed 
by  longer  stimulus  durations.  Stimulus  durations  were  made  short  enough  to 
interrupt  the  time  required  to  complete  a  two-digit  computation  in  order  to  require 
interruption  of  the  computation  by  the  presentation  of  the  next  digit  in  the 
problem . 


METHODS 


Sub  jec  ts . 
were  females 
cor  rec  ted  to 
sess  ion  dun  ng 


Twenty  university  subjects  were  tested.  Ten  of  these  participants 
and  ten  were  males.  Half  were  graduate  students.  Each  had  normal  or 
normal  visual  acuity.  Each  subject  participated  in  an  individual 
which  he  or  she  completed  all  conditions  of  the  experiment. 
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Design.  The  task  levels  were  the  following:  <2,  3,  or  4  numbers  to  be 

totaled)  x  (250,  400,  or  650  msec  per  number  an  :  operant).  bach  of  these  nine 
conditions  was  presented  in  two  separate  blocks  of  10  trials.  The  order  of 
presentation  of  the  nine  conditions  was  counterbalanced  as  described  in  Experiment 

I . 

Procedure.  The  procedure  was  identical  to  that:  used  in  the  first  experiment 
except  for  the  subject's  task.  In  this  experiment  the  subject's  task  was  to 
perform  a  serial  addition/subtraction  task.  Randomly  selected  digits  were  presented 
one  at  a  time  at  the  fixation  point.  In  each  trial,  two  digits  were  displayed  in 
sequence,  followed  by  a  plus  or  a  minus  sign.  If  the  trial  was  in  a  three-item 
condition,  a  third  digit  and  another  plus  or  minus  sign  followed.  If  it  was  a 
four-digit  trial,  a  fourth  digit  and  sign  followed.  The  subject's  task  was  to 
perform  this  series  of  computations  and  to  enter  the  units  digit  of  the  total 
(e.g.,  8,  6,  +  yields  14,  so  enter  4).  If  a  negative  answer  resulted,  the  subject 
added  10  and  entered  the  resulting  units  digit.  The  duration  of  each  digit  and  each 
operant  sign  was  constant  within  a  block  of  10  trials.  Between  trials,  this 
duration  varied  as  250,  400,  and  650  msec. 

The  modified  test  battery  provided  the  opportunity  to  vary  (via  menu)  the 
following  parameters  in  this  serial  arithmetic  task:  the  number  of  trials  in  each 
block,  the  number  of  digits,  and  the  duration  of  the  digits  and  the  arithmetic 
operants . 


RESULTS 

The  mean  response  times  to  correct  trials  in  each  block  were  analyzed  in  an 
analysis  of  variance.  A  mixed-factors  design  was  used  to  determine  the 
significance  of  the  three  repeated  measures  (number  of  items,  duration  of  items, 

and  trial  blocks).  The  mean  response  times  as  a  function  of  number  of  items  and 

item  duration  are  graphed  in  Figure  3.  The  effect  of  number  of  items  was 

significant,  F(2,38)  =  36.38,  p  <  .001.  Practice  (blocks),  F(l,19)  =  26.35,  p  < 
.001,  as  well  as  the  interaction  of  Item  x  Block,  F(2,38)  =  4.19,  p  <  .05,  were 
found  to  be  significant.  Response  speed  improved  by  450  msec  between  blocks  as  a 
function  of  practice;  however,  the  conditions  with  more  items  improved  more.  When 
the  response  latency  is  considered  as  cost  per  digit,  the  information  processing 
rate  is  virtually  constant  in  all  Block  1  conditions  (595,  583,  555  msec  for  two, 
three,  and  four  digits,  respectively.  Furthermore,  this  value  is  decreased  by  a 
constant  in  the  second  Block  to  449  ,  437  ,  and  397  msec  for  two,  three,  and  four 
digits,  respectively.  Item  duration  did  not  affect  response  latency  (see  Figure  3). 

Error  rates  increased  as  the  number  of  stimulus  items  increased.  This  effect 

appears  to  be  stronger  for  the  shorter  duration  stimuli.  The  error  rates  as  a 

function  of  number  and  duration  of  items  were  graphed  in  Figure  4.  In  addition, 
the  effect  of  practice  improved  the  error  rate  from  12.6  percent  to  9  percent 

between  Blocks  1  and  2.  This  practice  effect  did  not  appear  to  interact  with 
either  number  of  items  or  duration. 

While  both  latency  and  accuracy  showed  a  practice  effect  and  reflected 

variation  in  number  of  items,  only  error  rate  was  affected  by  changes  in  item 


250  DURRTION 
400  DURRTION 
650  DURRTION 


Figure  3.  Add/subtract  task:  Mean  response  latency  as  a  function  of  numbe 

stimulus  items  and  the  stimulus  durations. 
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and  logarithmic  models.  The  best  linear  model  t  ntered  bo"  ii  item  aid  block.  Both 
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is  consistent  with  tne  following  interpret,  i  in:  Sip  ;ect<-  wa  i  *  cd  cat  i  !  the  ear  i  re 
List  of  digits  was  complete  net  ore  tie/  uegan  to  <  output  e  to.-  -,i;ir .  rtocat.s.-  t  he 
shorter  presentation  rate  (250  msec)  allowed  them  less  time  to  consol idate  each  item 

(resolving  the  physical  image  and  perhaps  attaching  a  verba l  name ) ,  error  rates 

were  higher  for  shorter  stimulus  present  Lion  dm  at  ion:,. 


EXPEkJMENT  III 

In  the  third  experiment  subjects  were  asked  to  respond  to  at.  a.  ithmotic  task 
in  which  the  stimuli  were  present  for  the  entire  computation.  'ms  wav,  a 

memory  component  tor  the  stimulus  numbers  was  nut:  n  ess.,.  .• ,  »•  •.?  on l  v  a 

computational  memory  component  (for  carrying  values  treat,  column  to  column.'.  The 

task  conditions  varied  in  difficulty  as  a  function  oi  the  size  "units,  tens, 
hundreds)  and  the  number  of  items  to  be  added  Ashcraft  and  Stnzvk  reported 

that  increasing  problem  size  from  one-digit  to  two-digit  addends  increased  response 

time  sharply  irom  959  to  1,378  msec.  They  suggested  that  response  latenev  should 
reflect  the  difficulty  in  the  summing  of  each  column  plus  a  -  .nst.ir.t  imount  of  time 
for  each  carry  operation.  In  the  present  experiment,  both  the  number  of  digits  per 
addend  and  the  number  of  addends  in  the  problem  were  varied  re  determine  whether 
the  increase  in  problem  difficulty  from  these  two  sources  produced  euual  increments 
in  response  latency. 


METHODS 


Subjects.  Twenty  uni  ersitv  subjects  were  tested.  :  •_  o  ct 
were  females  and  ten  were  males.  Halt  were  graduate  ,tod>  i  t 
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individual  session  during  which  he  or  she  completed  ill 

experiment  . 
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Design.  The  task  levels  were  the  follow...,. 
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counterbalanced  as  described  in  Experiment  i. 
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MODE!  1  : 

LINEAR  (Possible  independent  variables 

.iti  dur  1 1  i  -jm  ,  i  t  err. , 

b lock  . ) 

ITEM 

.4  3L 

BLOCK 

-  .  4  b  4 

MODEL  3  : 

LOGARITHMIC  (Possible  independent 

variables  are  log  (duration) 

log  f  <  i  tern  )  , 

A. 

block.) 

/ 

LOC( ITEM) 

.860 

BLOCK 

.454 

CONSTANT 

.858 

Procedure.  The  procedure  was  identical  to  that  used  in  the  first  experiment 
except  for  the  subject's  task.  In  this  experiment  the  subject's  task  was  to 
perform  an  addition  task.  Numbers  were  presented  simultaneously  in  column  format  in 
the  center  of  the  screen.  In  each  block  of  trials,  either  two  or  three  numbers  were 
to  be  added.  In  each  block  the  digits  in  each  number  were  fixed  at  one,  two,  or 
three.  In  each  trial,  the  numbers  were  displayed  until  the  subject  entered  the 
rust  digit  of  the  total;  then  the  column  of  numbers  disappeared. 

Tiie  modified  test  battery  provided  the  opportunity  to  vary  (via  menu)  the 
tollowing  parameters  in  this  addition  task:  the  number  of  trials  in  each  block,  the 
number  of  digits  in  each  item,  and  the  number  of  items  to  be  added. 


RESULTS 

The  mean  response  times  to  correct  trials  in  each  block  were  analyzed  in  an 
analysis  of  variance.  A  repeated-measures  design  was  used  to  test  the  effect  of 
number  of  items  to  be  added,  number  of  digits  in  each  item,  and  trial  block.  The 
effect  of  blocks  was  not  found  to  be  significant  in  any  main  effect  or  interaction; 
therefore,  the  data  were  graphed  for  the  average  of  the  two  blocks  of  trials  in 
each  condition. 

The  mean  response  times  for  conditions  were  graphed  in  Figure  5.  The  effects 
of  items,  F(l,19)  =  119.4,  digits,  F(2,38)  =  187.4,  and  their  interaction  (Item  x 
Digit),  F(2,38)  =  61.8,  were  all  significant  at  p  <  .001. 

Error  rates  increased  as  the  number  of  digits  in  each  addend  increased  (1  vs. 
2  vs.  3).  While  the  error  rates  for  adding  two  or  three  one-digit  numbers  were 
virtually  identical,  error  rates  for  larger  addends  (two-  and  three-digit  numbers) 
were  higher  for  three-item  problems  than  for  two-item  problems.  The  error  rates 
were  graphed  in  Figure  6. 

When  the  graphs  of  the  two  dependent  variables  are  considered  together,  both 
show  decrements  in  performance  as  the  number  of  items  and  digits  increased. 

The  data  for  latencies  of  correct  responses  were  fit  to  line  ir ,  quadratic, 
and  logarithmic  models  for  each  retention  interval  group.  The  be ,t  linear  model 
entered  both  number  of  items  and  number  of  digits.  The  quadratic  and  the 
logarithmic  model  provide  a  similar  fit  by  adding  different  functions  of  the  same 
two  independent  variables.  The  regression  equations  for  each  model  are  listed  in 
Table  3. 

All  three  models  do  a  very  good  job  of  fitting  the  data.  The  best  model  is 
the  quadratic  model,  which  fits  the  data  with  one  independent  variable  (ITEM  * 
DIGIT)  and  explains  the  greatest  portion  of  the  variance  (RA  =  .74). 

Both  error  rates  and  response  latencies  were  affected  in  a  very  systematic 
way  by  the  increasing  load  of  computations.  Increasing  the  number  of  items  to  be 
added  or  the  number  of  digits  within  each  item  increased  latency  by  approximately 
the  same  amount  (4.70  and  4.72  seconds  for  each  unit  increase).  The  cost  to  the 
error  rate  of  increasing  the  size  of  the  computation  was  about  8  percent  per  unit. 
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Figure  5.  Column  addition  task:  Mean  response  latency  as  a  function  of  number  of 
digits  per  item  and  number  of  items  to  be  added. 
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Table  3 


Regression  Coefficients  for  the  -vie it  ion 
Predicting  Response  Latency  (CRT)  From  TasK  Variables 


MODEL  1:  LINEAR  (Possible  independent  variables  are  digit,  item,  block.) 


DIGIT 

4.70 

ITEM 

4.72 

CONSTANT 

-14.52 

R2 

.69 

MODEL,,  2:  QUADRATIC  (Possible  independent  variables  are  digit,  item,  d  igi  t  *  i  tem , 
digit  ,  item  ,  block.) 


ITEM*DIGIT 


3.42 


R 


2 


.74 


MODEL  3:  LOGARITHMIC  (Possible  independent  variables  are  log-(digit),  log  (item), 
block.) 


LOG( DIGIT) 
LOG( ITEM) 
CONSTANT 


5.74 

8.07 

-8.69 


MINERAL  DISCUSSION 


i  -<  i  t  !*•  >;  ti'ii'  experiments,  the  subjects'  tasks  were  varied  to  alter 

:  a.  •  :  ;  1 1  :  "  or  t  risk.  IVr :  nrmance  measures  of  accuracy  arid  speed  were 

;  i,.  dererniine  the  i  ", !  !  ue  nee  of  oxoerimpnter-def  ined  difficult'.’  (  j  .  ,  task 

.  :  v :  •  too  sea *  s  performance.  All  three  tasks  required  a  me.norv  load 

;  c  •  w )  .>{  the  tasks  required  a  computational  load.  The  purpose  or  the  research 
was  to  establish  the  sensitivity  of  performance  measures  to  variations  in  task 

il  i  I  t :  '-.if.  The  pattern  of  performance  measures  was  examined  to  determine  the 

i  ;  !  *  •  c  t  1  the  various  i  eve  Is  of  memory  and  computational  load. 

In  r  r-in.te  of  values  tested,  all  task  parameters,  except  stimulus  dura  t  i  or, 
t  he  serial  aritnm.etic  (  add  i  t  ion/ subt  rac  t  ion )  task  (-experiment  II),  if  fee  ted 
per  r  ormaace ,  showing  a  systematic  increase  in  latency  and  error  rates  as  a 

rue  lion  of  task  difficu'tv.  In  this  one  exception,  latency  was  no'  sensitive  to 
stimulus  darn  ion  even  over  i  large  range  of  values  ( 7  50  to  I  ,9Vi  msec  tor  the 

smallest  number  of  computat ions/problem;  and  1  ,750  to  4 ,550  msec  for  the  largest 

number  of  computat ions/problem) .  It  appears  either  (1)  subjects  were  doing  no 
computations  during  the  serial  presentation  of  the  items  to  be  added,  or  12) 

subjects  had  to  recalculate  some  portion  of  the  computations  after  the  final 
;  vr.jt  i  -mi  ( beginning  ■  >  f  the  response  interval).  In  either  case,  the  net  result  was 
that  tin-  response  latency  was  unaffected  by  the  amount  of  time  that  the  stimulus 

items  had  been  presented.  However ,  error  rates  were  greater  when  the  subject  had 
hid  less  time  to  view  each  item  and  operant.  The  improvement  in  accuracy  as  a 
function  of  viewing  time  may  be  a  function  of  greater  time  to  analyze  and  label 

t  tie  stimulus  before  having  to  process  the  next  incoming  item. 

fi;e  results  of  the  memory/ v  i  sua  1  search  task  ("Experiment  I)  were  complex 

be  *.mse  of  the  possible  s) eed/accuracy  trade-off  between  the  two  retention 

interval  groups.  The  most  parsimonious  explanation  of  the  differences  in 
performance  between  the  two  groups  is  that  the  o-second  retention  interval  group 
adopted  a  higher  accuracy  criterion  at  the  cost  of  responce  speed.  However, 

further  examination  of  the  patterns  of  responding  lead  to  a  more  interesting 

(though  post  hoc)  hypothesis.  Error  rates  were  comparable  for  the  two  groups  at  the 
longest  (h)  memory  set  size;  however,  response  latencies  were  slower  for  the  group 
with  the  longer  retention  interval.  Consolidation  of  the  memory  set  items  into  i 
different  ( perhaps  verbal)  code  during  the  extended  retention  interval  would 
•sec  ess  i  t  ate  a  different  retrieval  strategy  and  i  longer  search  to  mat  eh  time 

!  ■’  1  ,t  /r.  v  ,  1  )  . 

.  ttenev  in  tie-  '•>.  1  irar,  addition  task  (Experiment  HI)  increased  as  ,  nmrt  i  ,-n 


’  c  •  .,»  a  I  content  in  the  stimulus;  i.e.,  adding,  t  hr«  .•  .’-digit  items  was  p,  t  i  me- 
■  o-i  i is  adding  two  3-digit  numbers .  This  can  he  seen  in  either  :  ;  e  I  i  n-ar  -r 
trie  piadratjp  regression  model.  Either  model  explains  i  sul  :  i<*  ienf  port,  i  n  ■>;  the 
nr;  ist  (  k  “  -  .  hf)  md  .  7  ,  respect  i  ve  1  v  i  to  he  ffi'i-.l  i'Ted  a  ■  a  t  i  -  Me  t  o  ;  ■red  i  ,  •  t  i  ■  r 


U.-g'.ii  iv,  '.’•*/  5  )  .  However,  di  f  1  i-ri-nt  1  at  i  r-.g  bet  ween  the  line  ir  i:';d  ’  ‘-e  pi.id  r.a  t  i  > 

pr  ■  s:  i  •  M  r  mode  1  s  is  •  t  po  s  s  i  b  1  e  t  rom  these  da  t  t  .  i  'it  urc  . . .  ir.  b  w  ml  i  b.  ■  ut-e.ie  i 

f  ‘  -  f  r  a  i  so  whet  b»-r  it  is  fin-  t  ■  > 1  .  t  1  number  n!  ir  i  t  •  •  .p.-r  <  •  i  •  •  -  f  1  e 

j-iu  :  r  it  i  •,  ■  1  e  1  )  1 1  r  -  1 1  >•  independent  i  r.t  1  in-n.-es  a  i  t  •  •  -  <•:•.!  d  i  ■  -  ’  ■  '  '  1  •  1  '  -  ir 

:•  •  •••  ,  •  r  • ,  i :  lit!  i. re-  ponse  I  1 1  -"y  v .  Idle  e"!  t  i  ii"!  ■■  :  t  ;•••  !  ;  «  ir  ™  •  :•  .  •  r  >  ■ 

. .  '  •  :  ,r  d  i i  r  s  md  items,  re :  j  ;•  s  r  >  ; i ;  -  '  t  ;  r  ‘  - >c ■■ ! 

-  •  ■  '.■  !  r  i  r-.  i  t  be  ;>r  :d  ue  t  .  i  '  em  *  d  l  e  \  t  .  Is;  •  tv  ' '  i ;  i  ‘  i  -  ■  ■  ■  :  n  be 

•  •  •  r  ■  ’  i "  e  . f  ;  v  •  ,  pr  e  i  i  •  t  resportse  1  .it  e:ie  v  . 


-j-y. 


s'  s 


In  conclusion,  performance  was  found  to  be  a  sensitive  indicator  of  changes 
in  the  task  parameters  for  the  three  tasks  tested.  The  test  battery  is  an  easy, 
economical  method  of  administering  variations  of  these  tasks.  Future  research 
measuring  performance,  evoked  potentials,  and  subjective  responses  is  planned  using 
this  battery  of  three  tasks. 


U 


>  ■ 


•./.V.v.  /,  -■ 

1 


.V.V.  V.  ,•  C. 
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EXPERIMENT  I:  ANALYSIS  OP  VARIANCE  TABLE 


BETWEEN  S 

WI  rn I N  s 


SOL'KCE 

SS 

RETENTION  INTER 

24  .41 
373.24 

BLOCKS 

.03 

RET  I  N’T  X  BLOCK 

.09 
11  .87 

MEMORY  SET  SIZE 

194.38 

RET  INT  X  MEMORY 

4 .70 
17.33 

SEARCH  SIZE 

87  .32 

RET  INT  X  SEARCH 

9.19 

61 .82 

BLOCK  X  MEMORY 

.49 

RET  X  BL  X  MEM 

.01 

6.37 

BLOCK  X  SEARCH 

1.70 

RET  X  BL  X  SEAR 

.51 

12.08 

MEM  X  SEARCH 

6.40 

RET  X  MEM  X  SEAR 

1.13 

5.52 

BL  X  MEM  X  SEAR 

.26 

RET  X  B  X  M  X  S 

.11 

7.12 

TOTAL 

826.07 

MS 

df 

L 

24.41 

1 

1.18 

20.74 

18 

.03 

1 

.0  * 

.09 

1 

.13 

.66 

18 

194  .36 

1 

201 .83 

4.70 

1 

4.9 

.96 

18 

43  .66 

2 

25.43 

4.59 

2 

2.68 

1.72 

36 

.49 

1 

1  .39 

.01 

1 

.01 

.35 

18 

CO 

Ul 

2 

2.54 

.26 

2 

.76 

.34 

36 

3.20 

2 

20.83 

.56 

2 

3.7 

.15 

36 

.13 

2 

.66 

.06 

2 

.28 

.20 

36 
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EXPERIMENT  II:  ANALYSIS  OF  VARIANCE  TABLE 


SOURCE 

SS 

MS 

df 

F 

BLOCKS 

18.53 

18.50 

1 

26.43 

13.36 

.70 

19 

DURATION 

.13 

.06 

2 

.26 

8.92 

.23 

38 

ITEMS 

45.02 

22.51 

2 

36.31 

23.51 

.62 

38 

BLOCK  X  DUR 

1.30 

.65 

2 

1.51 

16.47 

.43 

38 

BLOCK  X  ITEMS 

1.69 

.84 

2 

4.20 

7.67 

.20 

38 

DUR  X  ITEMS 

1 .32 

.33 

4 

2.36 

10.52 

.14 

76 

BL  X  DUR  X  ITEMS 

.76 

.19 

4 

.79 

18.29 

.24 

76 

BET  S  ERR 

132.76 

6.99 

19 

TOTAL 

300.25 

359 

Grosso 


PRWWW 


EXPERIMENT  III:  ANALYSIS  OF  VARIANCE  TABLE 

i 

i 


SOURCE 

SS 

MS 

df 

F 

BLOCKS 

3.39 

3.39 

1 

1  .63 

39.46 

2.08 

19 

ITEMS 

1338.59 

1338.59 

1 

119  .42 

212.98 

11.21 

19 

DIGITS 

3556.84 

1778.42 

2 

187  .42 

360.58 

9.49 

38 

BLOCK  X  ITEMS 

1.76 

1.76 

1 

.71 

46.81 

2.46 

19 

BLOCK  X  DIGITS 

1.95 

.98 

2 

.48 

77.32 

2.03 

38 

ITEM  X  DIGIT 

504.47 

252.23 

2 

61  .84 

155.00 

4.08 

38 

BL  X  ITEM  X  DIG 

4.35 

2 .18 

2 

1 

82.50 

2.17 

38 

BETWEEN  S  ERR 

721.79 

37.99 

19 

TOTAL 


7107  .79 
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